rank | frequency | n-gram |
---|---|---|
1 | 9131 | -ς |
2 | 4872 | -ν |
3 | 4052 | -α |
4 | 2834 | -ι |
5 | 2095 | -η |
rank | frequency | n-gram |
---|---|---|
1 | 1463 | -ου |
2 | 1452 | -ας |
3 | 1310 | -ος |
4 | 1211 | -ες |
5 | 1167 | -αν |
rank | frequency | n-gram |
---|---|---|
1 | 808 | -ται |
2 | 730 | -ική |
3 | 568 | -ικό |
4 | 540 | -ίας |
5 | 520 | -ηκε |
rank | frequency | n-gram |
---|---|---|
1 | 369 | -εται |
2 | 343 | -ικής |
3 | 331 | -ικές |
4 | 328 | -θηκε |
5 | 313 | -νται |
rank | frequency | n-gram |
---|---|---|
1 | 221 | -ονται |
2 | 184 | -οντας |
3 | 164 | -θηκαν |
4 | 145 | -τικές |
5 | 138 | -ματος |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings